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Abstract. Recent authors have proposed analyzing conditional reason- 
ing through a notion of intervention on a simulation program, and have 
found a sound and complete axiomatization of the logic of conditionals 
in this setting. Here we extend this setting to the case of probabilistic 
simulation models. We give a natural definition of probability on formu- 
las of the conditional language, allowing for the expression of counter- 
factuals, and prove foundational results about this definition. We also 
find an axiomatization for reasoning about linear inequalities involving 
probabilities in this setting. We prove soundness, completeness, and NP- 
completeness of the satisfiability problem for this logic. 


Keywords: Counterfactuals - conditional reasoning - probabilistic pro- 
grams - conditional simulation. 


1 Introduction 


Accounts of subjunctive conditionals based on internal causal models offer an 
alternative to approaches based on ranking possible worlds by similarity [9]. One 
might, e.g., employ structural equation models (SEMs), i.e. systems of equations 
connecting the values of relevant variables, as the causal model; the semantics of 
conditionals are then based on a precise notion of intervention on the SEM [11]. 
Recently, some authors [8,10,4,3,1] have proposed using arbitrary programs, 
rather than systems of equations, as causal models. This approach emphasizes 
the procedural nature of many internal causal simulations over the purely declar- 
ative SEMs. 

It is possible to define precisely this idea of programs as causal models and 
to generalize the idea of intervention from SEMs to programs [8]. It is also 
possible to give a sound and complete logic of conditionals in this setting [6]. 
However, these preliminary results have not fully explored the very important 
case—from, e.g., the Bayesian Logic modeling language [10] and implicit in the 
use of probabilistic programs as cognitive models [3]—of conditionals in a prob- 
abilistic setting, via using stochastic programs as the underlying causal model. 


* Thanks to Thomas Icard for helpful discussions. The author was supported by the 
Sudhakar and Sumithra Ravi Family Graduate Fellowship in the School of Engineer- 
ing at Stanford University for this work. 
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In the present contribution we will establish foundational definitions and logical 
results for this setting, thus extending the causal simulation framework to prob- 
abilistic simulation programs. Probabilities over a causal modeling language are 
defined and results showing that they may actually be interpreted as probabili- 
ties are given. The probabilities are used to give the semantics of a language for 
probabilistic reasoning, for which an axiomatization is given. The language and 
axiomatization are extensions of an analogous probabilistic language considered 
for the purely propositional case by [2]. Soundness and completeness of the axiom 
system is proven, and the satisfiability problem is found to be NP-complete. 


2 Probabilistic Simulation Models and the Logical 
Language 


2.1 Simulation Models 


We work toward the definition of a language £ for expressing probabilities involv- 
ing probabilistic simulation models. Probabilistic simulation models extend the 
non-probabilistic! causal simulation models of [8, 6]. Formally, a non-probabilistic 
simulation model is a Turing machine”, and a probabilistic simulation model is 
a probabilistic Turing machine, i.e., a deterministic Turing machine (that of 
course still has a read-write memory tape) given read access to a random bit 
tape whose squares represent the results of independent fair coin flips. The use 
of Turing machines is meant to allow for complete generality and encompasses, 
e.g., both logic programming and imperative programming. We sometimes use 
intuitive pseudocode in describing simulation models; such pseudocode is readily 
convertible to Turing machine code. 

We suppose that simulation models are run initially from an empty tape.’ 
As a simulation model runs, it reads and writes the values of binary variables on 
its tape squares. Eventually, the model either halts with some resultant tape, or 
does not halt, depending on the results of the coin flips the model performs in 
the course of its simulation. Every probabilistic simulation model thus induces 
a distribution on these possible outcomes. We are interested not only in these 
outcomes, but also in the dynamics and counterfactual information embodied in 
the model. That is, we are interested in what would happen were we to hold the 


1 The use of “non-probabilistic” rather than “deterministic” is intended to prevent 
confusion of the probabilistic/non-probabilistic distinction with the deterministic 
Turing machine/non-deterministic Turing machine distinction. The former distinc- 
tion is about the presence of a source of randomness while the latter is about the 
number of possible halting executions. 

[6] does not require these machines to be deterministic, and isolates an additional 
logical principle that is valid when the machines are deterministic. However here 
we will suppose “non-probabilistic simulation model” always refers to one whose 
Turing machine is deterministic. This definition is more useful for comparison with 
the probabilistic case, in which all underlying machines are deterministic. 

[6] also includes an initial input tape in the definition of the model. This difference 
is inconsequential. 


w 


Causal Modeling with Probabilistic Simulation Models 


values of the tape square variables fixed in a particular way that counterfactu- 
ally differs from the actual values the squares take on—in the distribution over 
outcomes that results under a particular intervention: 


Definition 1 (Intervention [8]). Let S be a specification of binary values for 
a finite number of tape squares: S = {a;}ier for a finite index set I CN. Then 
the intervention Ts is a computable function from Turing machines to Turing 
machines specified in the following way. Given a machine T, the intervened ma- 
chine Ts(T) does the same thing as T but holds the variables in S to their fixed 
values specified by S throughout the run. That is, Ls(T) first writes x; to square 
i for alli € I, then runs T while ignoring any writes to any of the squares whose 
indices are in I. 


Suppose one fixes the entire random bit tape to some particular sequence in 
{0,1}°°. Then the counterfactual, as well as actual, behavior of a probabilistic 
simulation model is completely non-probabilistic. We define first a basic language 
that allows us to express facts about such behavior. Then we will define the 
probability that a given probabilistic simulation model satisfies a formula of this 
basic language. Our final language £ uses these probabilities—it thus expresses 
facts about the probabilities that counterfactual properties hold. In all logical 
expressions we help ourselves to these standard notational conventions: a > 8 
abbreviates ~a V 6, and a + 8 denotes (a > 8) A (8 > a). 


2.2 The Basic Language 


Syntax The basic, non-probabilistic language Lnon-prob is a propositional lan- 
guage over conditionals. Formally: 


Definition 2. Let X be a set of atoms {X1, X2, X3,...} representing the values 
of the memory tape variables and let Lprop be the propositional language formed 
by closing X off under conjunction, disjunction, and negation. 

Let the intervention specification language Lint C Lprop be the language of 
purely conjunctive, ordered formulas of unique literals,* i.e., formulas of the form 
li, Ate- Ali, for some n > 0, where ij < ij+1 and each l;, is either Xi, or 7Xi,. 
T abbreviates the “empty intervention” formula with n = 0. Let Leona be the 
conditional language of formulas of the form (a)8 for a € Lint, B E Lprop- 

The overall basic language Lnon-prob 18 the language formed by closing off the 
formulas of Leona > under conjunction, disjunction, and negation. 


Every formula a € Lins specifies an intervention Za by giving a list of variables 
to fix and which values they are to be fixed to. Given a subjunctive conditional 


4 The point being that such formulas are in one-to-one correspondence with specifi- 
cations of interventions, i.e., finite lists of variables along with the values each is to 
be held fixed to. 

5 Unlike [6], we do not admit the basic atoms X as atoms of £. There is no diffi- 
culty extending the semantics to such atoms, but allowing them would needlessly 
complicate the proof of Theorem 1. 
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formula (a) € Leona, we call a the antecedent and 8 the consequent. We use 
[a] for the dual of (a), i.e., [a]S abbreviates 4(a)(=G). Note that ()y holds in a 
program if the unmodified program halts with a tape making y true. 


Semantics The semantics of the basic language are defined from considering a 
subjunctive conditional to be true in a simulation model when the program so 
intervened upon as to make its antecedent hold halts with such values of the tape 
variables as make its consequent hold. For example, consider a simple model that 
checks if the first memory tape square Xo is 1 and if so writes a 1 into the second 
tape square X,, and otherwise simply halts. This program satisfies the formulas 
()aXo, () 7X1, but also the counterfactual formula (Xo)(Xo A X1): holding the 
first memory square fixed to 1 causes a write of the value 1 into the second tape 
square, thus satisfying the consequent Xo A X1. Formally: 


Definition 3. Let T be a non-probabilistic simulation model. Define T -non-prob 
(a) 6 iff Za(T) halts with a memory tape whose variable assignment satisfies B. 
Now suppose T is probabilistic, and fix values for all squares on the random 
bit tape to some sequence r € {0,1}. Define T,r H (ab iff Za(T) when run 
with its random bit tape fixed to r halts with a resultant memory tape satisfying 
B. Define (in both cases) satisfaction of arbitrary formulas of Lyon-prob in the 
familiar way by recursion. 


In a sense, the validities of the non-probabilistic setting carry over to this setting, 
as we will now show. For Y E€ Lyon-prob, Write Enon-prob ¥ if y is valid in the class 
of all non-probabilistic simulation models. We will see that all such formulas are 
still valid for probabilistic simulation models, under Definition 3, once one fixes 
the random bit tape to a particular sequence. 


Lemma 1. Fpon-prob Y if and only if, for all probabilistic simulation models T 
and all r € {0,1}%, we have that T,r = y. 


Proof. Suppose Ffnon-prob Y. Consider some probabilistic simulation model T 
and sequence r € {0,1}. p is composed of Leona-atoms, of the form (a)/. 
What is the behavior of Z,(T),r? Either Z,(T),r reads only a finite portion 
of r or reads an unbounded portion of r (in the latter case, it also does not 
halt). If only a finite portion is read, let N(a) be the maximal random bit tape 
square reached of r. Let N be the maximum of the N(a) for all atoms a in y, 
clearly existent as y has finite length. Construct a Turing machine T’ from T 
that embeds the contents of r up to index N into its code, replacing any read 
from r with its value. This is possible in a finite amount of code as we only have 
to include values up to N in T’. 

What if Z,(T),r ends up reading an unbounded portion of r? We note that 
it is possible to write code in T’ to check if the machine is being run under an 
a-fixing intervention—i.e., conditional code that runs under Z,,(T’) and no other 
intervention. Add such code to T’, including an infinite loop conditional on an 


6 For the precise details of this construction, see [6]. Briefly, if one wants to check if 
some X; is being held fixed by an intervention, one can try to toggle X;; this attempt 
will be successful iff X; is not currently being fixed by an intervention. 
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a-intervention for each case where Z,(T),r reads an unbounded portion of r. 
Now, for all atoms (a) 8, T’ Fnon-prob (a) 6 iff T,r H (a) 3. As this holds for any 
atom of p, and Fnon-prob Y, we have that T,r = ¢ as desired. 

Now, suppose that T,r } y for all probabilistic T,r. We want to see that 
Hnon-prob Y. Given a non-probabilistic T, convert T to a probabilistic TM T’ that 
never reads from its random tape, and take any random tape r. Then T’,r = y 
so that T Fnon-prob Y- 


2.3 Adding Probabilities 


Syntax Z£ is the language of linear inequalities over probabilities that formulas 
of Lnon-prob hold. More precisely: 


Definition 4. Let Lineg be the language of formulas of the form 
aP(y1) + +++ + anP(pn) < c (1) 


for some n € N, and c,ay,...,4n E Z, 1,..-, Pn E Lnon-prob- Then L is the 
language of propositional formulas formed by closing off Lineq under conjunction, 
disjunction, and negation. 


We sometimes write inequalities of a different form from (1) with the under- 
standing that they can be readily converted into some £-formula. For example, 
an inequality with a > sign is a negation of a Lincg-formula. 


Semantics Let T be a probabilistic simulation model. We will shortly define a 
probability Pr : Lnon-prob — [0,1]. Now suppose a given y € Lineq has the form 
(1). Then T — ọ iff the inequality (1) holds when each P(y;) factor takes the 
value Pt(y;). Satisfaction T = y for arbitrary p € £ is then defined familiarly 
by recursion. Given Y E€ Lyon-prob, the probability P7(y) is simply the (standard) 
measure of the set of infinite bit sequences r for which T,r = p. More formally: 
let X be the o-algebra on {0,1} generated by cylinder sets and u be the 
standard measure defined on X.” Now let S(p) = {r € {0,1}% : T,r = p}. 
Then we define Pr(y) = w(S(y)). The following Lemma ensures that S(y) is 
always measurable, so that this definition is valid. 


Lemma 2. For any p E€ Lnon-prob, we have S(p) E€ X. 


Proof. Proof by induction on the structure of y. If y = ~y, then S(p) is the 
complement of a set in X and hence is in X. The case of a conjunction or 
disjunction is similar since X is closed under intersection and union. The base 
case is that of the atoms. Consider an atom of the form (a). If Z,(T) halts on 
x with random bit tape fixed to r, then it does so reading only a finite portion 
of r. Thus S((qa)£) is the union of cylinder sets extending finite strings on which 
T.(T) halts with a result satisfying 8, and hence is in X. 


T That is, as the product measure of Bernoulli(1/2) measures, as defined in, e.g., [3]. 


D. Ibeling 


This probability is coherent in the sense that it plays well with the logic of the 
basic language: 


Proposition 1. For any probabilistic T we have, 


1. Pr (vy) =1 if FFnon-prob Y for pe Lnon-prob 
2. Pr(y) < Pr(w) whenever Hnon-prob Y > Y for p,p E Lnon-prob 
3. Prp) T Pr(y ^ v) + Pr(y A =p) for all P, p = Lyon-prob 


Proof. (1) holds since in this case, by Lemma 1, S(p) = {0,1}°. (2) holds 
since in this case, S(y) C S(q). Finally (3) holds by noting Fnon-prob Y + 


((pAW)V (p An7w)), applying (2), and noting that S(p A w) and S(p A 7w) are 
disjoint. 


A corollary of part (2) is that logical equivalents under Enon-prob preserve prob- 
ability. 


2.4 The Case of Almost-Surely Halting Simulations 


An interesting special case is that of the simulation models that halt almost- 
surely, i.e., with probability 1 under every intervention. Call this class M+. Fol- 
lowing the urging of [7] we have not restricted the definition of probabilistic sim- 
ulation model to such models. We will see that from a logical point of view, this 
case is a natural probabilistic analogue of the class Me Sa of non-probabilistic 
simulation models that halt under every intervention. By this we mean that we 


may prove an analogue to Lemma 1. Write = nonepiob Y if p E€ Lyon-prob is valid 
in Prk a Note that Lemma 1 does not hold if one merely changes all the 


preconditions to be halting/almost-surely halting: consider a probabilistic sim- 
ulation model T that repeatedly reads random bits and halts at the first 1 it 
discovers; this program is almost-surely halting. But if r is an infinite sequence 
of Os, then T,r  ()T, even though ares ()T. Crucially, we must move to 
the perspective of probability and measure to see the analogy: 


Lemma 3. eee Y if and only if, for all T € M}, we have T,r = y for all 


r € {0,1}© except on a set of measure 0. 


Proof. Suppose PE etoi y. We claim that for all T € M+ we have T,r | y for 


all r except on a set of measure 0. Again, consider an atom (a)( appearing in y. 
The set of r for which Za(T), r does not halt has measure 0, given that T € Mt. 
On each such r, the run of Z,(T), r must read infinitely many bits of r: otherwise, 
the intervened machine would have a nonzero probability of not halting. Thus, 
excluding such r, it is possible to repeat the construction of T’ from the proof 
of Lemma 1 for (œ), and in doing this construction we are already ignoring all 
cases where an unbounded portion of r is read. This means that we do not have 
to include any infinite loops in T’, and T’ will be always-halting. If we exclude 
all the such r arising from all antecedents of atoms of y, then we only exclude 
a set of measure 0 since there are finitely many atoms. Except for such r, the 
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construction works, and T’ has, as before, the same behavior as T. But since 


+} 
M non-prob P, 


For the opposite direction, let T € M 


we have that T,r | ọ except on the excluded set of measure 0. 


4 


non-prob' 


We wish to show that T 


M non-prob 


y. Convert T to an identical probabilistic simulation program T’ that never reads 
from its random tape. We have T’,r = ¢ for all r but on a set of measure 0; in 


particular, for at least one r. This implies T }non-prob Y- 


3 Axiomatic Syst 


ems 


We will now give an axiomatic system for reasoning in £ and prove that it is 
sound and complete with respect to probabilistic simulation models: it proves 
all (completeness) and only (soundness) the formulas of £ that hold for all prob- 
abilistic simulation models. We will give an additional system that is sound 
and complete for validities with respect to the almost-surely halting simulation 


models M+. 


Definition 5. Let AX be a set of rules and axioms formed by combining the 


following three modules. 


1. PC: propositional reasoning (tautologies and modus ponens) over atoms of 


L. 
2. Prob: the following axioms: 
NonNeg. 
Norm. 
Add. 
Dist. 


P(p) = P(Y) whenever 


(p ^Y) = Pv) 


P 
) F-non-prob Y > yp 


3. Ineq, an axiomatization (see [2]) for reasoning about linear inequalities: 


Zero. 


Permutation. 


h 


Add!Ineq. 


Mult. 


Dichotomy. 


an 


Mono. P(p 


WN 


an 


P(Yn) < ©) 
anP(pn) + OP(Yn+41) < ©) 


P(pn) < c) S (a Ploj) + + an P(n) < ©) 


1;---; Jn are a permutation of 1,...,n 

) + + anP(pn) < c) A (aPC) + + an Pn) < e) 
+ a4 )P(y1) +: + (an + @,,)P(Yn) < (e+ €')) 

) an P(Yn) < c) 


- + banP(Yn) < bc) for any b > 0 


P(pn) < ©) V (aiP(y1) + +++ + anP(pn) 2 ©) 


P(Yn) < ©) 


=> (aP(p1) +--+ +anP(pn) <b) ifb >c 
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Additionally, let AX! be the system formed in exactly the same way, but replacing 
+ 


Fnon-prob 7 


IM non-prob with 


Note that the non-probabilistic validities Fyon-prob and omnes appearing in 


Dist, have been completely axiomatized in [6]. The main result is: 


Theorem 1. AX (respectively, AX‘) is sound and complete for the validities of 
L with respect to M (respectively, M+). 


Proof. Soundness (of Prob) follows from Lemma 1, Proposition 1, and, for the 
almost-surely halting case, Lemma 3. For completeness, consider the general case 
of M first. As usual, it suffices to show that any consistent y € £ is satisfiable by 
some probabilistic simulation model. We put y into a normal form from which we 
construct a canonical model. By PC we may suppose y is in disjunctive normal 
form. We may further suppose that it is a conjunction of Lineq-literals, as at 
least one (conjunctive) clause in the disjunctive normal form must be consistent. 
Let a1,...,@n E Leona be the atoms that appear inside any probability P in y, 
and let 61,...,69» represent all the formulas of the form lı A--- Al, that can 
be obtained by setting each l; to either a; or —a;. We then have the following, 
which is a kind of normal form result: 


Lemma 4 (Lemma 2.3, [2]). y is provably-in-AX equivalent to a conjunction 


(P(d1) > 0) A+++ A (P(b2n) > O)A 
(P(6:) +--+ P(dan) =1)A 
(a1,1P(d1) +--+ + @1,2»P(dan) < e1)A 
OA 
(am 1P(ô1) + +++ + @m,anP(d2n) < Cm)A 
(a4, P(d1) +++ + ah gnP(dan) > A 
PAN 

(tg g(r) +--+ aly gn (an) > ey) (2) 


for some integer coefficients cy, ... , Cm, C1) +++ 1 Cnty 1,15 ++ +1 Gm,2"1 AL 19 -+- Am gn- 


Proof. Let Y E€ Lnon-prob be any of the formulas appearing inside of a probability 
P in y. Note that P(#) = P(Y A hL) + P(Y Aal) by Add. Moving on to ls, 
we have, provably, Pw) Ah) = P(Y Al Al) + P(Y ALl A mlz), and we may 
rewrite P(Y A -l,) similarly. Applying this process successively, we have P(Y) = 
P(wWA61)+---+P(w Adon). For any term in the right-hand side of this inequality, 
if Y = ĝi, propositional reasoning by Dist allows us to replace the term by P(6,), 
and if not, by 0. Thus we always have that P(Y) = 6, P(6,) +--+ + bgnP(d92) 
for some coefficients b;. Applying this process to each P-term in y and using 
Ineq to rewrite the left-hand sides of the inequalities, and conjoining the (clearly 
provable) clauses that P(6;) > 0 for all 1 < i < 2”, and P(é,) +---+P(d9n) = 1, 
we obtain (2). 
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The conjunction (2) can be seen as a system of simultaneous inequalities over 
2” unknowns, P(d;),...,P(dgn). Ineq is actually sound and complete for such 
systems (we refer the reader to Section 4 of [2] for the proof of this fact). So if y 
is consistent with AX—which includes Ineq—this system must have a solution. 
Thus there are values P(d;) solving (2). We will now construct a probabilistic 
simulation model having precisely these probabilities of satisfying each 6;. Note 
that for any 6; with Enon-prob L + 6; it is provable that P(d;) = 0, and we may 
conjoin this to (2). Note also that 6; A 6; is unsatisfiable for any i # j. Given 
these two observations, the following Lemma implies the result. 


Lemma 5. For any collection of satisfiable Lyon-prop-formulas ~1,..., Pn no 
two of which are jointly satisfiable, and any rational probabilities p1,...,P~n > 0 
such that py +--+ pn = 1, there is a probabilistic simulation model T such that 
Pr(yi) = pi for alli, 1<i<n. 


Proof. Since the y; are satisfiable, there are non-probabilistic simulation models 
Thon-prob,1) +++ Tnon-prob,n Such that for alli = 1,...,, we have Thon-prob,i Frnon-prob 
pi. Further, we may suppose the machines so constructed use only a bounded 
number of memory tape squares. Thus let the maximum index of a tape square 
used by any of the Tnon-prob be N. We now describe T informally. Suppose 
without loss of generality that for all 7, p; = a;/b for some common denominator 
b. Let T draw a random number r from 1 up to b uniformly, and ensure that 
T does any auxiliary computations it might need only on squares with indices 
at least N + 1. Check whether r < aj, and if so, let T branch into the code of 
Thon-prob,1- If not, check if aj +1 < r < a; +a2 and if so, branch into Tyon-prob,2- 
Repeat the process for p3,...,Pn- It’s clear that the probability of branching 
into each Tyon-prob,i block is exactly pi, and the same is true under any rele- 
vant (i.e., involving only memory tape variables that appear in one of the yi) 
intervention on T: we may suppose any auxiliary computations T might require 
use only memory tape squares with indices past N. After branching into the ith 
block, the behavior of T is exactly the same as that of Thon-prob,i, meaning that 
any random bit tape fixings that end up causing a branch into this block will be- 
long to S(y;). Another random bit tape fixing that causes a branch into another 
block, say the jth, cannot belong to S(y;) since y;,y; are jointly unsatisfiable. 
Thus, P+(¢;) = p; for all i. 


Finally, we must see that this model lies in M+ if the original formula is con- 
sistent with AX‘. [6] has shown that Saran [a] — (a)8. Then in the proof 
of Lemma 5, we may suppose that each Tyon-prob,i block contains only always- 
halting code,’ and hence that T does not contain any loops either: thus it almost- 


surely halts. 


8 Why? Since y; are satisfiable, they are consistent with the axiomatization for non- 
probabilistic simulation models given by [6], and hence are satisfied by the canonical 
models given in [6]. These models use only boundedly many tape squares. 

° Since the canonical programs of [6] for ME secs contain only such code. 


D. Ibeling 


4 Computational Complexity 


Call the problem of deciding if a formula y € £ is satisfiable PROB-SIM-SAT(y). 
Theorem 2 shows that solving this problem is no more complex than is proposi- 
tional satisfiability. 


Theorem 2. PROB-SIM-SAT(y) is NP-complete in |p| (where this length is 
computed standardly). 


Proof. It’s NP-hard since, given any propositional 7, the formula P(r) > 0 is 
satisfiable iff m is satisfiable (consider a machine that does nothing but write a 
satisfying memory tape assignment out). In order to show that the satisfiability 
problem is in NP, we give the following nondeterministic satisfiability algorithm: 
guess a program from a class of programs (that we will define shortly) that 
includes the program constructed in Lemma 5 —call this canonical program 
T,—and check (in polynomial time) if it satisfies y. This algorithm decides sat- 
isfiability since, by soundness, a satisfiable formula must be consistent, and hence 
has a canonical model of the form constructed in Lemma 5. For the remainder of 
the proof, by the “length of a number,” we just mean the length of its computer 
(binary) representation. The “length of a rational” is the sum of the lengths of 
its numerator and its denominator. 

What is the class of probabilistic simulation models that we may limit our 
guesses to? For some fixed constants C, D € N, we will define a class Mọ c,D- 
We will then show that there exist C,D such that the canonical program of 
Lemma 5 belongs to M,y,.c,p for all consistent y. Let My.c.p be the fragment 
of probabilistic simulation models whose code consists of the following: 


1. Code to draw a random number uniformly between 1 and some N, such that 
N has length at most D]y|°. 

2. At most n = C|y| branches, that is, copies of: an if-statement with condition 
L< r <u, whose body is a canonical program Py, for some Yi E Lnon-prob; 
of the same form as the non-probabilistic canonical models (i.e., in the class 
defined in the proof of Theorem 2 from [6}). 


Letting 4;, u; be the bounds for the ith copy in (2), we also require that 4 = 1, 
and that €;4; = u;+1 for all i, and that un = N. The following fact from linear 
algebra (we refer the reader to [2] for the proof) helps us to show that for all 
consistent p, the canonical program T, belongs to My,c,p for some C, D. 


Lemma 6. A system ofm linear inequalities with integer coefficients of length at 
most L that has a nonnegative solution has a nonnegative solution with at most m 
variables nonzero, and where the variables have length at most O(mél+m log m). 


Apply this lemma to (2). Each inequality in (2) originally came from y, so there 
are O(|y|) of them. Further, recall that each integer coefficient in (2) came from 
summing up a subset of 2” coefficients originally from vy, with n is the number of 
atoms appearing anywhere inside P expressions in y. As this n is thus O(|y|)— 
and hence 2” is O(|y|) in length—and each original coefficient is also O(|y|) 
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in length, each coefficient is O(|y|) in length as well (lengths of products add). 
Thus Lemma 6 shows that without loss of generality, we may suppose that the 
solutions for the P(d;) of (2) have O(|y|?) length. The common denominator of 
these O(|y|) rationals hence has O(|y|°) length. The construction of Lemma 5 
has one branch for each of them, and hence O(|y|) branches. This shows the 
existence of D for part (1) of the definition of My .c,p and the existence of a 
C for part (2). We will abbreviate M, = My,c,p for some choice of C, D thus 
guaranteed. 

It remains to show that given any program T € My, we can check if T = y 
in polynomial time. It suffices to show that checking if T = w for Y © Lineq 
is polynomial time: if we know whether T |} w for every w that ọ is built 
out of, we can decide in linear time if T H y. Thus suppose w has the form 
a,P(y1) +--+: + a,P(Yn) < c. [6] shows that one may check if the Py, in part 
(2) of the definition of M, satisfy any formula of the basic language Lnon-prob 
in polynomial time. Then we can easily compute P(y;) as simply the sum of the 
probabilities of each branch that satisfies y;. Doing the arithmetic to check if w 
is satisfied is then certainly polynomial time, so we have our result. 


5 Conclusion and Future Work 


We have defined and obtained foundational results concerning a very natural 
extension of counterfactual intervention on simulation models to the probabilistic 
case. 

One critical operation in probability is conditioning, or updating probabili- 
ties given that some event is known to have occurred (in the subjective inter- 
pretation, updating a belief for known information). One may already define 
conditional probabilities in the usual way in the current framework, and our 
framework (without interventions) covers the conditional simulation approach 
to certain aspects of common-sense reasoning of [3]. In this approach, one limits 
oneself to the runs satisfying a certain query; the framework considered here 
would be equivalent for any queries expressible as formulas of Lnon-prob- [2] also 
give a logic for reasoning about conditional probabilities. Future work would 
involve extending this system to probabilistic simulation models and studying 
the complexity of reasoning in that setting. 

As [8,6] note, the simulation model approach invalidates many important 
logical principles that are valid in other approaches [5, 11,9], such as cautious 
monotonicity: [A](B AC) — [AA B]C. However the approach is otherwise quite 
general, and an important future direction would be to identify and characterize 
subclasses of of simulation models that validate this and other similar logical 
principles. We have begun investigating this extension. An interesting conse- 
quence it has is on the comparison of conditional probability with the probabil- 
ities of subjunctive conditionals: while these two probabilities are not in general 
equal in the classes M or M+, they are equal in certain restricted classes. 

A final direction we want to mention concerns “open-world” reasoning includ- 
ing first-order reasoning about models with some domain, where counterfactual 
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antecedents might alter how many individuals are being considered or which in- 
dividuals fall under a property or bear certain relations to each other. Recursion 
and the tools of logic programming [4, 10] make this very natural for the simula- 
tion model approach, and we would like to understand the first- and higher-order 
conditional logics that result in this approach, in both the non-probabilistic and 
probabilistic cases. We have also begun exploring this direction. 
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